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OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/983,000 
Input Set : A:\es.txt 

Output Set: N:\CRF3\11022001\I983000.raw 



DATE : 11/02/2001 
TIME : 15:30:46 



Does Not Comply 
Corrected Diskette Needed 



3 <110> APPLICANT: AGY Therapeutics 

4 Melcher, Thorsten 

5 Mueller, Sabine 

6 Chin, Daniel 

8 <120^ TITLE OF INVENTION: USE OF PROTEIN BIOMOLECULAR TARGETS IN THE TREATMENT AND 
VISUALIZATION OF 

9 BRAIN TUMORS 

11 <130> FILE REFERENCE: 263/180 PEagleman -- AGY 

C--> 13 < 1 4 0 > CURRENT APPLICATION NUMBER: US/09/98 3, 000 

C--> 13 <141> CURRENT FILING DATE: 2001-10-17 

13 <160> NUMBER OF SEQ ID NOS : 35 

15 <170> SOFTWARE: Patentln version 3.1 



ERRORED SEQUENCES 




The type of erron shown #xtst throughout 
the Sequence Listing. Please check subsequent 
sequences for similar errors. 




<210> SEQ ID NO: 23 
<211> LENGTH: 3143 
<212> TYPE: DNA 
<213> ORGANISM: Homo sapiens 
<220> FEATURE: 
<221> NAME/KEY: REFERENCE 
<222> LOCATION^ (1). . (3143) 

<223> OTHER INFORMATION: Crosby, A.H., Edwards, S.J., Murray, J.C. and 

enomic organization of the human osteopontin gene: exclusion of 
he locus from a causative role in the pathogenesis of dentinoge 
sis imperfecta type II JOURNAL Genomics 27 (1 ), 155-160 (1995 ) 



FEATURE: — 

NAME/KEY: Gene 

LOCATION: (1) . . (314 3) 

OTHER INFORMATION: Osteopontin 

FEATURE : 

NAME/KEY: CDS 

LOCATION: ( 374 ) . . ( 427 ) 

OTHER INFORMATION: 

FEATURE : 

NAME/KEY: CDS 

LOCATION: (537) . . (575) 

OTHER INFORMATION: 

FEATURE : 

NAMF / K Trv ■ ■"^o 




6 2 50 < .2 22;* LOCATION: (1232).. (12 7 3) 
6251 <:223> OTHER INFORMATION: 



file: C: C Rf : 3 Outhold VsrI983(XX).htm 
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6254 <220> FEATURE: 



C: ( Rf ; 3 OutholiJ \ srl 983(KK).htni 



II 2 01 
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RAW SEQUENCE LISTING ' DATE: 11/02/2001 

PATENT APPLICATION: US/09/983 , 000 TIME: 15:30:47 

Input Set : A:\es.txt 

Output Set: N:\CRF3\11022001\I983000.raw 



6255 <221> NAME/KEY: CDS 

6256 <222> LOCATION: ( 15 97 ) . . ( 1920 ) 

6257 <223> OTHER INFORMATION: 

6260 <220> FEATURE: 

6261 <221> NAME/KEY: CDS 

6262 <222> LOCATION: ( 2 3 0 5 ) . . ( 2 7 0 9 ) ^xiO^-f^c^ I 

6263 <223> OTHER INFORMATION: . v. , i _ no 
E --> <^6^5^T^^ » iW ^ 15 

6267 ggggaagtgt gggagcaggt gggctgggca gtggcagaaa cctgatgaca caatctcgcc 60 
6269 gcctccctgt gttggtggag gatgtctgca gcagcattta aattctggga gggcttggtt 120 
6271 gtcagcagca gcaggaggag gcagagacag catcgtcggg accagactcg tctcaggcca 180 
6273 gttgcagcct tctcagccaa acgccgacca aggtacagct tcagtttgct actgggttgt 240 
6275 gcattcaqct qaatttcatg gggaagtcca aattctaagg aaaaaaatgt ggtagtataa 300 
6277 aaaggtatca ctgttgtaac ctatgaagat gtcagctatt cctttgaaat attttgcagg 3bu 

6279 aaaactcact acc atg aga att gca gtg att tgc ttt tgc etc eta ggc 409 

6280 Met Arg lie Ala Val lie Cys Phe Cys Leu Leu Gly 

6281 15 10 

6283 ate acc tgt gec ata cca gtgagtacag ttgeatctta aagaaaattc 457 

6284 He Thr Cys Ala He Pro 

6285 15 

6287 ctgaaaataa ctgaattgtg tgcttccatg tgctaggagg acattcttgt aatctttctt 517 

6289 catcttttct gtttctaag gtt aaa cag get gat tct gga agt tct gag gaa 569 

6290 Val Lys Gin Ala Asp Ser Gly Ser Ser Glu Glu 

6291 20 25 

6293 aag cag gtaagcatct tttatgtttt tatatagtta aatcatttac tcaattatgg 625 

6294 Lys Gin 

6295 30 

6297 cgagaggtgc aagaaacgta tttgetgega tcaaatgagt tcatatttgt aaagcaattt 685 
6299 gaaagagtgc ctagcccaca gtaagtgcta cataagagtt tgttaaatga atetgeaaaa 745 
6301 aaaaaaaaaa ttacaaaaag gtacctaagg gtccgggtga etatatgett ccatcaagac 805 
6303 tagtgaagaa tggttgtttt ttccattcat ccctacattt ctttttttaa taatgataaa 865 

6305 catgeaaett ttttgtag ctt tac aac aaa tac cca gat get gtg gec aca 916 

6306 Leu Tyr Asn Lys Tyr Pro Asp Ala Val Ala Thr 

6307 35 40 

6309 tgg eta aac cct gac cca tct cag aag cag aat etc eta gec cca cag 964 

6310 Trp Leu Asn Pro Asp Pro Ser Gin Lys Gin Asn Leu Leu Ala Pro Gin 

6311 45 50 55 

6313 gtatttttaa acttctcata attaaactac agtgatgaaa gatagecaca ctcaggccat 1024 
6315 ttgggctgct cagatgaatc ctgccctgcc tgctggcaaa catgtgctta ggacattgac 1084 
6317 tgatctgeca tgttggcttc tctctgtgtt aagccatcca cagatgaggc tgaaaaataa 1144 
6319 aaactgcttt ggattaaaaa ggttaacttt tgaataaaaa agctaggcat gtgtgatgcg 1204 

6321 cactaacacg tgccattcct tcttcag aat get gtg tec tct gaa gaa acc aat 1258 

6322 Asn Ala Val Ser Ser Glu Glu Thr Asn 

6323 60 65 

6325 gac ttt aaa caa gag gtaaqttctc attttcaatc agaqqeccat catgecttga 1313 
6 3 26 Asr> Phe Lvs Gin Glu 
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RAW SEQUENCE LISTING DATE: 11/02/2001 

PATENT APPLICATION: US/09/98 3 , 000 TIME: 15:30:47 



Input Set : A:\es.txt 

Output Set: N:\CRE3\11022001\I983000.raw 



6333 


aatactaggt 


ttccteggat aacctaaaag ccatggtatg 


tactgtgaat 


gcaaagattc 


1493 


6335 


tgaaactaaa 


taaaaagaaa gatagtaaaa gactaatgtg 


ctataaaggc 


taagggaaaa 


1553 


6337 


taaaaaccca 


tatattaatt ttcccggcca 


l tcttaatttt 


cag 


acc 


ctt 


cca 


agt 


1608 


6338 


























Thr 


Leu 


Pro 


Ser 




6339 






























75 






6341 


aag 


tec 


aac 


gaa 


age 


cat 


gac 


cac 


atg 


gat 


gat 


atg 


gat 


gat 


gaa 


gat 


1656 


6342 


Lys 


Ser 


Asn 


Glu 


Ser 


His 


Asp 


His 


Met 


Asp 


Asp 


Met 


Asp 


Asp 


Glu 


Asp 




6343 








80 










85 










90 








6345 


gat 


gat 


gac 


cat 


gtg 


gac 


age 


cag 


gac 


tec 


att 


gac 


teg 


aac 


gac 


tct 


1704 


6346 


Asp 


Asp 


Asp 


His 


Val 


Asp 


Ser 


Gin 


Asp 


Ser 


He 


Asp 


Ser 


Asn 


Asp 


Ser 




6347 






95 










100 










105 










6349 


gat 


gat 


gta 


gat 


gac 


act 


gat 


gat 


tct 


cac 


cag 


tct 


gat 


gag 


tct 


cac 


1752 


6350 


Asp 


Asp 


Val 


Asp 


Asp 


Thr 


Asp 


Asp 


Ser 


His 


Gin 


Ser 


Asp 


Glu 


Ser 


His 




6351 




110 










115 










120 












6353 


cat 


tct 


gat 


gaa 


tct 


gat 


gaa 


ctg 


gtc 


act 


gat 


ttt 


ccc 


acg 


gac 


ctg 


1800 


6354 


His 


Ser 


Asp 


Glu 


Ser 


Asp 


Glu 


Leu 


Val 


Thr 


Asp 


Phe 


Pro 


Thr 


Asp 


Leu 




6355 


125 










130 










135 










140 




6357 


cca 


gca 


acc 


gaa 


gtt 


ttc 


act 


cca 


gtt 


gtc 


ccc 


aca 


gta 


gac 


aca 


tat 


1848 


6358 


Pro 


Ala 


Thr 


Glu 


Val 


Phe 


Thr 


Pro 


Val 


Val 


Pro 


Thr 


Val 


Asp 


Thr 


Tyr 




6359 










145 










150 










155 






6361 


gat 


ggc 


cga 


ggt 


gat 


agt 


gtg 


gtt 


tat 


gga 


ctg 


agg 


tea 


aaa 


tct 


aag 


1896 


6362 


Asp 


Gly 


Arg 


Gly 


Asp 


Ser 


Val 


Val 


Tyr 


Gly 


Leu 


Arg 


Ser 


Lys 


Ser 


Lys 




6363 








160 










165 










170 








6365 


aag 


ttt 


cgc 


aga 


cct 


gac 


ate 


cag 


gtaaatcctt taacagacac acctgatggt 


1950 


6366 


Lys 


Phe 


Arg 


Arg 


Pro 


Asp 


He 


Gin 




















6367 






175 










180 




















6369 


tctgactagc 


gctcaagtct aggaaaccac 


agtttgcata 


ttcattcatt 


cattcatcca 


2010 


6371 


ttcattcatc 


cattcagcaa gaattcattc 


atattctact 


ttatgaccat 


tgaatacaaa 


2070 


6373 


tctttttctg 


cttggcggtt tttgtaagtc 


tacataattt 


ctctctagat 


ttgattctca 


2130 


6375 


aacacaattc 


tactttttga aatcctggat 


caaagtaaca 


tgctagtatt 


atttcageca 


2190 


6377 


gatttagaca 


atttttagta taagatgacc 


taaaagctag 


agagtggaaa 


aggattacca 


2250 


6379 


tattcccatc 


cctagccgtt catataatta 


ttcttcattt 


gtgccgtgat 


tcag 


tac 


2307 


6380 
































Tyr 




6383 


cct 


gat 


get 


aca 


gac 


gag 


gac 


ate 


acc 


tea 


cac 


atg 


gaa 


age 


gag 


gag 


2355 


6384 


Pro 


Asp 


Ala 


Thr 


Asp 


Glu 


Asp 


He 


Thr 


Ser 


His 


Met 


Glu 


Ser 


Glu 


Glu 




6385 








185 










190 










195 








6387 


ttq 


aat 


ggt 


qca 


tac 


aag 


gec 


ate 


ccc 


gtt 


gec 


cag 


gac 


ctg 


aac 


qcq 


2 4 0 3 


6388 


Leu 


Asn 


Gly 


Ala 


Tyr 


Lys 


Ala 


He 


Pro 


Val 


Ala 


Gin 


Asp 


Leu 


Asn 


Ala 




6 389 






200 










205 










210 










6 391 


cct 


tct 


gat 


tgg 


gac 


age 


cgt 


ggg 


aag 


gac 


agt 


tat 


gaa 


acg 


agt 


cag 


2451 


6392 


Pro 


Ser 


Asp 


Trp 


Asp 


Ser 


Arg 


Gly 


Lys 


Asp 


Ser 


Tyr 


Glu 


Thr 


Ser 


Gin 




6393 




215 










220 










225 












6395 


ctg 


gat 


gac 


cag 


agt 


get 


gaa 


acc 


cac 


age 


cac 


aag 


cag 


tec 


aga 


tta 


24 99 


6396 


Leu 


Asp 


Asp 


Gin 


Ser 


Ala 


Glu 


Thr 


His 


Ser 


His 


Lys 


Gin 


Ser 


Arg 


Leu 




6397 


2 30 










235 










240 










245 




6399 


tat 


aaq 


egg 


aaa 


qcc 


aat 


gat 


gag 


age 


aat 


qaq 


cat 


tec 


gat 


gtg 


att 


2547 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/98 3,000 



DATE: 11/02/2001 
TIME: 15:30:47 



E-<T_6580 



Input Set : A:\es.txt 

Output Set: N:\CRF3\11022001\I983000.raw 

6404 Asp Ser Gin Glu Leu Ser Lys Val Ser Arg Glu Phe His Ser His Giu 

6405 265 270 275 

6407 ttt cac age cat gaa gat atg ctg gtt gta gac ccc aaa agt aag gaa 

6408 Phe His Ser His Glu Asp Met Leu Val Val Asp Pro Lys Ser Lys Glu 

6409 280 285 290 

6411 gaa gat aaa cac ctg aaa ttt cgt att tct cat gaa tta gat agt gca 

6412 Glu Asp Lys His Leu Lys Phe Arg lie Ser His Glu Leu Asp Ser Ala 

6413 295 300 305 

6415 tct tct gag gtc aat taa aaggagaaaa aatacaattt ctcactttgc 

6416 Ser Ser Glu Val Asn 

6417 310 

6419 atttagtcaa aagaaaaaat gctttatagc aaaatgaaag agaacatgaa atgcttcttt 
6421 ctcaqtttat tggttgaa-tg tgtatctatt tgagtctgga aataactaat gtgtttgata 
6423 attagtttag tttgtggctt catggaaact ccctgtaaac aaaagcttca gggttatgtc 
6425 tatgttcatt ctatagaaga aatgeaaact atcactgtat tttaatattt gttattctct 
6427 catgaataga aatttatgta gaagcaaaca aaatactttt acccacttaa aaagagaata 
6429 taacatttta tgtcactata atcttttgtt ttttaagtta gtgtatattt tgttgtgatt 
6431 atcttttgtg gtgtgaataa atcttttatc ttgaatgtaa taag 

6569 <210> SEQ ID NO: 25 

6570 <211> LENGTH: 259 

6571 <212> TYPE: PRT 

6572 <213> ORGANISM: Homo sapiens 

6574 <220> FEATURE: 

6575 <221> NAME/KEY: Gene 

6576 <222> LOCATION: (1)..(259) 

6577 <223> OTHER INFORMATION: Carbonic Anhydjrase domain of human carbonic anhydrase III 

"<4^To>~^QUENCL T r~Tr) ^ 4fj e^r€ p^e^l- 

Ala Lys Glu Trp (5Iy Tyr Ala Ser His Asn Gly Pro Asp His Trp His 
6583 15 10 15 

6586 Glu Leu Phe Pro Asn Ala Lys Gly Glu Asn Gin Ser Pro lie Glu Leu 

6587 20 25 30 



2643 



J691 



2739 



2799 
2859 
2919 
2979 
3039 
3099 
3143 



6590 


His 


Thr 


Lys 


Asp 


He 


Arg 


His 


Asp 


Pro 


Ser 


Leu 


Gin 


Pro 


Trp 


Ser 


Val 


6591 






35 










40 










45 








6594 


Ser 


Tyr 


Asp 


Gly 


Gly 


Ser 


Ala 


Lys 


Thr 


He 


Leu 


Asn 


Asn 


Gly 


Lys 


Thr 


6595 




50 










55 










60 










6 598 


Cys 


Arg 


Val 


Val 


Phe 


Asp 


Asp 


Thr 


Tyr 


Asp 


Arg 


Sor 


Met. 


Leu 


Arq 


Gly 


6 5 9 9 


6 5 










70 










75 










80 


6 6 0 2 


Gly 


Pro 


Leu 


Pro 


Gly 


Pro 


Tyr 


Arg 


Leu 


Arq 


Gin 


Phe 


His 


Leu 


His 


Trp 


6 603 










85 










90 










95 




6606 


G 1 y 


Ser 


Ser 


Asp 


Asp 


His 


Gly 


Ser 


Glu 


His 


Thr 


Val 


Asp 


Gly 


Val 


Lys 


6607 








100 










105 










110 






6610 


Tyr 


Ala 


Ala 


Glu 


Leu 


His 


Leu 


Val 


His 


Trp 


Asn 


Pro 


Lys 


Tyr 


Asn 


Thr 


6611 






115 










120 










125 








6614 


Phe 


Lys 


Glu 


Ala 


Leu 


Lys 


Gin 


Arg 


Asp 


Gly 


He 


Ala 


Val 


He 


Gly 


He 


6 615 




130 










135 










140 










6618 




Leu 


Lys 


T le 


Gly 


His 


Glu 


Asn 


G 1 y 


G 1 u 


Phe 


Gl n 


lie 


Phe 


Leu 


Asp 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/98 3 , 000 



DATE: 11/02/2001 
TIME: 15:30:47 



Input Set : A:\es.txt 

Output Set: N:\CRF3\11022001\I98 3000.raw 



6626 Phe Asp Pro Ser Cys Leu Phe Pro Ala Cys Arg Asp Tyr Trp Thr Tyr 

6627 180 185 190 

6630 Gin Gly Ser Phe Thr Thr Pro Pro Cys Glu Glu Cys lie Val Trp Leu 

6631 195 200 205 

6634 Leu Leu Lys Glu Pro Met Thr Val Ser Ser Asp Gin Met Ala Lys Leu 

6635 210 215 220 

6638 Arg Ser Leu Leu Ser Ser Ala Glu Asn Glu Pro Pro Val Pro Leu Val 

6639 225 230 235 240 

6642 Ser Asn Trp Arg Pro Pro Gin Pro lie Asn Asn Arg Val Val Arg Ala 

6643 245 250 255 
6646 Ser Phe Lys 
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VERIFICATION SUMMARY DATE: 11/02/2001 

PATENT APPLICATION: US/09/9 8 3,000 TIME: 15:30:48 



Input Set : A:\es.txt 

Output Set: N:\CRF3\11022001\I983000.raw 



L 


13 M 


270 C 


: Current Application 


Number differs, Replaced Current Application No 


L 


13 M 


271 C 


: Current Filing Date 


differs, Replaced Current Filing Date 






L 


1900 


M 


257 


W 


Feature 


value mis 


-spelled or invalid, 


v221> Name/Key for SEQ 


ID# : 


6 


L 


1915 


M 


257 


W 


Feature 


value mis 


-spelled or invalid, 


<:221> Name/Key for SEQ 


ID# : 


6 


L 


2762 


M 


257 


W 


Feature 


value mis 


-spelled or invalid, 


<221> Name/Key for SEQ 


ID# : 


7 


L 


2768 


M 


259 


W 


Al lowed 


number of 


lines exceeded, <22 


3> Other Information: 






L 


3408 


M 


257 


W 


Feature 


value mis 


-spelled or invalid, 


<221> Name/Key for SEQ 


ID# : 


11 


L 


4386 


M 


257 


W 


Feature 


value mis 


-spelled or invalid, 


<221> Name/Key for SEQ 


ID# : 


13 


L 


4392 


M 


259 


W 


Allowed 


number of 


lines exceeded, <22 


3> Other Information: 






L 


4786 


M 


257 


w 


Feature 


value mis 


-spelled or invalid, 


<221> Name/Key for SEQ 


ID# : 


15 


L 


5392 


M 


257 


w 


Feature 


value mis 


-spelled or invalid, 


<221> Name/Key for SEQ 


ID# : 


17 


L 


5644 


M 


257 


w 


Feature 


value mis 


-spelled or invalid, 


<221> Name/Key for SEQ 


ID# : 


19 


T, 


5650 


M 


259 


w 


Allowed 


number of 


lines exceeded/ <22 


3> Other Information: 






L 


6030 


M 


257 


w 


Feature 


value mis 


-spelled or invalid, 


<221> Name/Key tor SEQ 


ID* : 


21 


L 


6036 


M 


259 


w 


Allowed 


number of 


lines exceeded, <22 


3> Other Information: 






L 


6216 


M 


257 


w 


Feature 


value mis 


-spelled or invalid, 


<221> Name/Key for SEQ 


ID# : 


23 


L 


6266 


M 


212 


E 


(34) Invalid or duplicate Sequence ID 


Number, SEQUENCE ID NOS 


: 23 


differs 


L 


6580 


M 


212 


E 


(34) Invalid or duplicate Sequence ID 


Number, SEQUENCE ID NOS 


: 25 


differs 
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<i io> 

< 1 20> 

< 1 30> 



< 1 < 0 > 

< 1 < 1 > 



Smith. John; Smithgcnc Inc. 
example of a Sequence Listing 
01 -00001 



PCT/CP98/0OOO1 
1998-12-31 



< 1 S0> 

< 1 bl > 



US OB/999,999 
iyy /-iu-ib 



< 1 G0> 



< 1 70> 



Patentln version 2.0 



< 2 1 0> 

< 2 1 1 > 

< 2 1 ? > 

< 2 1 3 > 



1 

309 
ON A 

Pel ramcc i um s p 



< 2 20> 

< 2 2 1 > 

< 2 2 2 > 

< 300> 

< 301 > 

< 30? > 

< 303 

< 304 

< 305 

< 306 

< 307 

< 3 00 
* 309 



CDS 

(2 7 9) . . . (3 89} 



Doc. Richard 

Isolation and Character i z a i ion of a Ccnc encoding a 

Protease from Paramecium r,p 

Journal of Genes 

1 

< 

1 - 7 

1988-06-3 1 
1 ? 3 < S G 

1 9 0 G - 0 6 - 3 1 



' c oo • i 

a g c i 9 t a g t c <n ccctgigi ccicucict <. t g y g < i t c i c a c c c c. g c i a a t c a g a t c t c 

agggagagtq Lcttgaccct cctcigccu tgcagcitca coggcaggco ggcaggcagc 

igatgtggca attgetggea gtgecacagg cttttcagcc aggcitaggg tgggttccgc 



GO 
1 20 

i no 



rgeggegegg cggcccctci cgcgctccic tcgcgccici cictcgctct ccictcgc 



?40 



PClfM/EXlSC/22/6 
Annex VII. page 30 



Appendix 3, page 2 

^cct 0 .u aggtgagcag gaggaggggg ^c^c atg g 1 1 tea atg tec age 296 

1 S 

ttg tct ttc aaa tgg cct gga ttt tgt ttg ttt gtt tgt teg ttc caa 3<< 

Leu Sdr Phc Lys Trp Pro Cly The Cys Leu Phc Va 1 Cys Leu Phc cin 

10 IS 20 



tgt ccc aaa gtc etc ccc tgt cac Lea tea ctg cag ccg aat ecu 
Cys Pro Lys Va 1 Leu Pro Cys Mis Scr Scr -Leu Cln Pro Asn. Leu 
25 30 35 



<210> 2 

<211> 37 - , 

< 2 1 2 > PRT 

< 2 1 3 > Paramecium sp. 

<<00> 2 , 

Met Vol Scr Met Phc Scr Leu Scr Phc Lys Trp Pro Cly Phc. Cys Leu 

1 5> 10 15 

Phc Vol Cys Leu Phc Cln Cys Pro Lys Vol Leu Pro Cys Mis Scr Scr 
20 2 5 30 

Leu Cln Pr o Asn Lgvi 
3 ^ 

< 2 1 0> 3 
<21 1 > 11 

< 2 1 2 > PRT 

< 2 1 3 > Artificial Sequence 
<220> 

< 2 2 3 > Designed peptide hosed on size and polarity to act as a 

linker between the alpha and beta chains of Protein XVZ 

<<00> 3 

Met vol Asn Leu Clu I'io Met Wis Thr Glu lie 

1 b 10 

< 2 1 0 > < 

< < 00 > < 

000 



389 



[Annex VIII follows| 



IU-qu.rcmcni$ for Appl.cai.o.n OC Oju 23 J U n^99E 



JM '»o K o»/»LVofi.cc*'co...'so.'0 6 /l99£/„ ccl>2S/pai ^ 



x ue lit- a 
tabl c . 



~he numeric identifier shall be used only in the Sequence 
Listing.^ The order and presentation of the items of information in the 
"Sequence Listing" shall conform to the arrangement given below. Each 
item of information shall begin on a new line and shall begin with the 
numeric identifier enclosed in angle brackets as shown. The submission 
of those items of information designated with an "M" is mandatory. The 
submission of those items of information designated with an "O" is 
optional. Numeric identifiers <110> through <170> shall only., be set 
forth at the beginning of the "Sequence Listing. " Tfie following table 
illustrates the numeric identifiers. 



Nume r i c 
Identi f 1 er 



De f ini 1 1 on 



Comments and 
forma t 



Mandatory (M) 
Optional ,(0) 



<110> 



Appl leant 



Preferably max . 

of 10 names; 

one name per line; 

preferable format: 

Su rnamc , Othc r f- 

Namcs and/or 

Initials 



M 



<120> 



Title of 
I nvent ion 



<130> 



file Reference 



Personal file 
r e f c rence 



M. when filed prior 
to assignment of 
appl. numbc r 



< 1 <0> 



Current Applica- Specify as: 
Lion Number US 07/999,999 or 

PCT/US96/99999 



if avai lab) 



< 1 4 1 > 



Current Tiling Specify as: yyyy-mm-dd 

Date 



M, if available 



< 1 S0> 



Prior Application Specify as: 
Number US 07/999, 999 or 

PCT/US96/99999 



M, if appl 1 cab 1 e 
l nc 1 ude priori ty 
documents under 
35 USC 119 and 
120 



< 1 b\ > 



Prior Application Specify as: yyyy-mm-dd M , if applicable 
Filing Da t c 



1 6 0 : 



Number of S L(j 11 > Count includes 



NO; 



total numbc r o f 
SKQ I [) NO:, 



< 1 70: 



Software 



Name of software used 
to create the 
Sequence Listing 



'? 1 0: 



SEC ID NO: II 



Response shall be an 
integer r cp r e 
s e n 1 1 ng the S ZQ 



t ( 1 s l d u e 



• rcntCMi* lot Al'pl^Ji.ons OC J^nv 



I 90S 



l.., r 



<212> 



Type 



Whether presented 
sequence mole- 
cule is DNA, 
RNA, or PRT 
(protein ) . If 
a nucleotide 
sequence con- 
tains both DNA 

and RNA frag- 
ments, the 

type shall be 

"DNA. " In ad- 

d i t i on , the 

combined DNA/ 

RNA molecule 

shall be further 

deser 1 bed 1 n 

the <220> to 

<223> feature 

section. 



<21 3> 



Organi sm 



Sci cnti f ic name, 
i.e. Gcnus/spccics 
Unknown or Artifi- 
cial Sequence. In 
addition, the 
"Unknown" or 
"Art 1 f l ci a 1 Se- 
quence" organisms 
shall be f u r t he r 
described in the 
<220> to <223> 
feature section. 



<220> 



FeaLurc 



Leave blank after 
<220>. <221-223> 
provide for a 
description of 
points of bio- 
logical signi- 
ficance in the 
sequence . 



M, under the 
foil owing cond 1 - 
t i ons : if " n , " 
"Xaa , " or a mod - 
lfied or unusual 
L - ami no acid or 
modified base was 
used in a se- 
quence; if ORGAN- 
ISM is "Arti f l - 
c l a 1 Sequence" o r 
"Un known " ; if 
mo 1 cculc is 
r omb l ncd DNA / flNA 



<22 1 



Name / K c y 



Provide appropriate 


K . under the 


f ol - 


identifier for 


lowing condit 


ions 


feature, pre- 


if " n , " "Xaa, 


o r 


ferably ( rom 


a mod l ( l cd or 


u n - 


W I PO S unda rd 


usual L-amino 




ST 2 *) (1 998) , 


acid or mod l f 


l ed 


Appe nd l x 2 , 


base was u s c d 


i n 


Tables S and 6 


a sequence 





first and last 

h h s e s / a m i no a c i d <- 



usual L a rr, i n o 
acid or modified 



<223> 



Other Infor- 
ma 1 1 on 



in feature 



Other relevant 

1 n forma 1 1 on ; 

four lines maximum 



base was used in 
a sequence 

M, under the fol- 
lowing conditions: 
if M n, " "Xaa , " or 
a modified or un- 
usual L-amino acid 
or modi f ied base 
was used in a 
sequence; if 
ORGANISM 
is "Artificial 
Sequence" or 
"Unknown"; ■ i-f 
molecule is com- 
bined DNA/RNA. 



<300> 



<301> 



Publ i ca Li on 
I n forma Lion 

Authors 



Lea vc blank 
after <300> 

Preferably max 
of Leu fidiuCu 
authors of publi- 
cation ; speci f y 
one name per line, 
preferable forma L : 
Surname, Other 
Name s a nd/ o r 
Initials 



0 



< 302 > 

< 30 3 > 

< 304 > 

< 305> 

< 3 0 G > 

< 307 > 



Title 
Journal 
Vo 1 ume 
Issue 
Pa<je s 
Da t c 



Journal date on which 
data published; 
specify as yyyy-mm- 
d d , KMM - y y y y or 

Sea son - y y y y 



O 
O 
O 
O 
O 



< 309 > 



Dal aba s c 
Access i on 
N ufni)c r 



Database Entry 
Da t c 



A { ( c s s ion n u rnbe i 
assigned by data- 
base including 
database name 

Dale of entry in 
database; specify 
as y y y y ~ mm - d d or 
MMM - yy y y 



r ar W , i e . - 
0 "/ / 9 9 q _ 9 9 9 



i.i.,, 



<3) 1 > 



<312> 



<31 3> 



Patent Filing 
Da tc 



Document filing 
date, for patent- 
type citations only; 
specify as yyyy-mm-dd 



Publication Date Document publication 
date, for 
pa ten t - t ypc 
citations only; 
specify as yyyy-mm-dd; 



Relevant 
Res i dues 



PROM (position) TO 
(pos i t ion I 



<400> Sequence SEQ ID NO should " M 

follow the 
numeric identifier 
and should appear 
on the line pre- 
ceding the actual * 
sequence 

5. Section 1.02< is revised to read as follows: 

1.024 form and format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 



fa) The computer readable form required by 
following specifications: 



1 .02 1 { c ) sha 1 1 mccL the 



(1) The computer readable form shall contain a single "Sequence Listing" 
as either a diskette, scries of diskettes, or other permissible media 
outlined in paragraph (c) of this section. 

(2) The "Sequence Listijrig" in paragraph (a) (1) of this section shall be 
submitted in American Standard Code for Information Interchange (ASCII) 
text. No other formats shall be allowed. 

(3) The computer readable form may be created by any means, such as word 
processors, nuc 1 co t i de /ami no acid sequence editors or othet custom 
computer programs; however, it shall conform to all specifications 
detailed in this section. 

H) Pile compression is acceptable when using diskette media, so long as 
the compressed file is in a s c 1 f - c x t r a c t i ng format that will decompress 
on one of the systems described in paragraph (b) of this section. 

( 5 ) Page numbering shall not appear within the computer loadable form 
version of the "Sequence Listing" tile. 

(6) All computer readable forms shall have a label permanently affixed 
thereto on which has been hand-printed or typed: the name of the 
applicant, the title of the invention, the date on which the data were 
recorded on the computer readable form, the operating system used, a 
reference number, and an application sctial number and filing date, if 
k no wn . 



(?) Operating System MS-DOS. Unih or Macintosh 



